Borno State
How ISWAP and Boko Haram are reshaping the Lake Chad Basin
The killing of Abu-Bilal al-Minuki, the second-in-command of ISIL (ISIS), by United States and Nigerian forces marks a notable achievement for "counterterrorism". Yet for analysts observing the Lake Chad Basin, it highlights how persistent and complex insecurity in the region has become. Al-Minuki, a Nigerian national from Borno State, was operating out of a compound near Lake Chad, at the centre of one of the world's most active armed group theatres. Perhaps equally significant is the parallel resurgence of Boko Haram, which quietly rebuilt itself while security agencies primarily focused on the more dominant ISWAP. "While regional forces focused on countering ISWAP's threats, partly due to the group's advanced drone capabilities, Boko Haram appears to have taken advantage of the relative attention on its rival to regroup," Nimi Princewill, a security expert in the Sahel, told Al Jazeera.
Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning
Ishibashi, Yoichi, Yano, Taro, Oyamada, Masafumi
Large Language Models (LLMs) have demonstrated significant improvements in reasoning capabilities through supervised fine-tuning and reinforcement learning. However, when training reasoning models, these approaches are primarily applicable to specific domains such as mathematics and programming, which imposes fundamental constraints on the breadth and scalability of training data. In contrast, continual pretraining (CPT) offers the advantage of not requiring task-specific signals. Nevertheless, how to effectively synthesize training data for reasoning and how such data affect a wide range of domains remain largely unexplored. This study provides a detailed evaluation of Reasoning CPT, a form of CPT that uses synthetic data to reconstruct the hidden thought processes underlying texts, based on the premise that texts are the result of the author's thinking process. Specifically, we apply Reasoning CPT to Gemma2-9B using synthetic data with hidden thoughts derived from STEM and Law corpora, and compare it to standard CPT on the MMLU benchmark. Our analysis reveals that Reasoning CPT consistently improves performance across all evaluated domains. Notably, reasoning skills acquired in one domain transfer effectively to others; the performance gap with conventional methods widens as problem difficulty increases, with gains of up to 8 points on the most challenging problems. Furthermore, models trained with hidden thoughts learn to adjust the depth of their reasoning according to problem difficulty.
Could AI save Nigerians from devastating floods?
In the small village of Ogba-Ojibo in central Nigeria, sitting at the confluence of two of the nation's largest rivers – the Niger and Benue – 27-year-old Ako Prince Omali is counting the steps carved out of the dirt, which lead down the loam-coloured banks of the river Niger. This river bank, dotted with tufts of spiky grass, is where villagers come to fish or wash produce and laundry. Just last week, three of the steps were submerged during one night of rain, which raised the water level by about five metres. Normally, you can count seven steps down into the river. Now, only four remain above the surface of the water, the sticks bracing the muddy steps having washed away in the deluge.
A Bayesian Regression Approach for Estimating the Impact of COVID-19 on Consumer Behavior in the Restaurant Industry
The COVID-19 pandemic has had a long-term impact on industries worldwide, with the hospitality and food industry facing significant challenges, leading to the permanent closure of many restaurants and the loss of jobs. In this study, we developed an innovative analytical framework using Hamiltonian Monte Carlo for predictive modeling with Bayesian regression, aiming to estimate the change point in consumer behavior towards different types of restaurants due to COVID-19. Our approach emphasizes a novel method in computational analysis, providing insights into customer behavior changes before and after the pandemic. This research contributes to understanding the effects of COVID-19 on the restaurant industry and is valuable for restaurant owners and policymakers.
Analyzing COVID-19 Vaccination Sentiments in Nigerian Cyberspace: Insights from a Manually Annotated Twitter Dataset
Ahmad, Ibrahim Said, Aliyu, Lukman Jibril, Khalid, Abubakar Auwal, Aliyu, Saminu Muhammad, Muhammad, Shamsuddeen Hassan, Abdulmumin, Idris, Abduljalil, Bala Mairiga, Bello, Bello Shehu, Abubakar, Amina Imam
Numerous successes have been achieved in combating the COVID-19 pandemic, initially using various precautionary measures like lockdowns, social distancing, and the use of face masks. More recently, various vaccinations have been developed to aid in the prevention or reduction of the severity of the COVID-19 infection. Despite the effectiveness of the precautionary measures and the vaccines, there are several controversies that are massively shared on social media platforms like Twitter. In this paper, we explore the use of state-of-the-art transformer-based language models to study people's acceptance of vaccines in Nigeria. We developed a novel dataset by crawling multi-lingual tweets using relevant hashtags and keywords. Our analysis and visualizations revealed that most tweets expressed neutral sentiments about COVID-19 vaccines, with some individuals expressing positive views, and there was no strong preference for specific vaccine types, although Moderna received slightly more positive sentiment. We also found out that fine-tuning a pre-trained LLM with an appropriate dataset can yield competitive results, even if the LLM was not initially pre-trained on the specific language of that dataset.
Nigerian military drone attack kills 85 civilians in error
A Nigerian military attack that used drones to target rebels instead killed at least 85 civilians gathered for a religious celebration, authorities said Monday. The attack was the latest in recent errant bombings of residents in Nigeria's troubled regions; between February 2014 when a Nigerian military aircraft dropped a bomb on Daglun in Borno state killing 20 civilians and September 2022, there were at least 14 documented incidences of such bombings in residential areas. The attack on Sunday night in Tudun Biri village of Kaduna state's Igabi council area took place as Muslims gathered there to observe the holiday celebrating the birthday of the Prophet Muhammad. Kaduna Governor Uba Sani said civilians were "mistakenly killed and many others were wounded" by a drone "targeting terrorists and bandits". The National Emergency Management Agency said in a statement on Tuesday that "85 dead bodies have so far been buried while search is still ongoing".
Leveraging Closed-Access Multilingual Embedding for Automatic Sentence Alignment in Low Resource Languages
Abdulmumin, Idris, Khalid, Auwal Abubakar, Muhammad, Shamsuddeen Hassan, Ahmad, Ibrahim Said, Aliyu, Lukman Jibril, Sani, Babangida, Abduljalil, Bala Mairiga, Hassan, Sani Ahmad
The importance of qualitative parallel data in machine translation has long been determined but it has always been very difficult to obtain such in sufficient quantity for the majority of world languages, mainly because of the associated cost and also the lack of accessibility to these languages. Despite the potential for obtaining parallel datasets from online articles using automatic approaches, forensic investigations have found a lot of quality-related issues such as misalignment, and wrong language codes. In this work, we present a simple but qualitative parallel sentence aligner that carefully leveraged the closed-access Cohere multilingual embedding, a solution that ranked second in the just concluded #CoHereAIHack 2023 Challenge (see https://ai6lagos.devpost.com). The proposed approach achieved $94.96$ and $54.83$ f1 scores on FLORES and MAFAND-MT, compared to $3.64$ and $0.64$ of LASER respectively. Our method also achieved an improvement of more than 5 BLEU scores over LASER, when the resulting datasets were used with MAFAND-MT dataset to train translation models. Our code and data are available for research purposes here (https://github.com/abumafrim/Cohere-Align).